Ultra-Low-Power Voice Activity Detection System Using Level-Crossing Sampling

نویسندگان

چکیده

This paper presents an ultra-low-power voice activity detection (VAD) system to discriminate speech from non-speech parts of audio signals. The proposed VAD uses level-crossing sampling for detection. useless samples in the signal are eliminated due activity-dependent nature this scheme. A 40 ms moving window with a 30 overlap is exploited as feature extraction block, within which output analog-to-digital converter (LC-ADC) counted feature. only variable used distinguish and segments input number LC-ADC time window. achieves average 91.02% hit rate 82.64% over 12 noise types at ?5, 0, 5, 10 dB signal-to-noise ratios (SNR) TIMIT database. including LC-ADC, extraction, classification circuits was designed 0.18 µm CMOS technology. Post-layout simulation results show power consumption 394.6 nW silicon area 0.044 mm2, makes it suitable always-on device automatic recognition system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zero-Crossing-Based Ultra-Low-Power A/D Converters Citation

| Since the first demonstration of a comparatorbased switched-capacitor circuit, analog-to-digital (A/D) converters based on virtual ground detection have made steady and significant progress. Comparators have been replaced by zero-crossing detectors, leading to the development of zerocrossing based circuits for faster speed and lower power. All facets of performance including the sampling rate...

متن کامل

Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation

This paper proposes a novel hands-free voice activity detection (VAD) method utilizing not only temporal features but also spatial features, called adaptive zero crossing detection (AZCD), that uses talker direction estimation. It firstly estimates talker direction to extract two spatial features: spatial reliability and spatial variance, based on weighted cross-power spectrum phase analysis an...

متن کامل

Low frequency ultrasonic voice activity detection using convolutional neural networks

Low frequency ultrasonic mouth state detection uses reflected audio chirps from the face in the region of the mouth to determine lip state, whether open, closed or partially open. The chirps are located in a frequency range just above the threshold of human hearing and are thus both inaudible as well as unaffected by interfering speech, yet can be produced and sensed using inexpensive equipment...

متن کامل

Smart Ultra Low Power Energy Harvesting System

Small embedded systems operating in unattended conditions do need to be perpetually powered if a truly pervasive paradigm is envisaged. Harvesting energy from the surrounding environment seems to be the best option. For that, a set of systems has been proposed featuring interesting solutions but not yet capable of overcoming some issues like performance and flexibility. The authors propose a no...

متن کامل

Processing of Non-Stationary Signal Using Level-Crossing Sampling

The spectral characteristics of multimedia signals typically vary with time. Preferably, the sampling density of them would comply with instantaneous bandwidth of signal. The paper discusses the level-crossing sampling principle, which provides such capability for analog-to-digital conversion. As the captured samples are spaced non-uniformly, the appropriate digital signal processing is require...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12040795